Improving Transliteration Mining by Integrating Expert Knowledge with Statistical Approaches
نویسندگان
چکیده
منابع مشابه
Improving Transliteration Mining by Integrating Expert Knowledge with Statistical Approaches
This paper contributes a study of methods for integrating human expert knowledge with machine learning approaches for determining phonetic similarity of word pairs. A method is proposed which allows a human to provide a structure for the edit costs that are based around a phonetically-motivated model of phoneme sound groups, and the machine to determine precise values for these costs within two...
متن کاملImproving Cross-Language Information Retrieval by Transliteration Mining and Generation
The retrieval performance of Cross-Language Retrieval (CLIR) systems is a function of the coverage of the translation lexicon used by them. Unfortunately, most translation lexicons do not provide a good coverage of proper nouns and common nouns which are often the most information-bearing terms in a query. As a consequence, many queries cannot be translated without a substantial loss of informa...
متن کاملIntegrating Expert Knowledge and Experience
A major challenge in the field of AI is combining symbolic and statistical techniques. My dissertation work aims to bridge this gap in the domain of real-time strategy games.
متن کاملIntegrating an Unsupervised Transliteration Model into Statistical Machine Translation
We investigate three methods for integrating an unsupervised transliteration model into an end-to-end SMT system. We induce a transliteration model from parallel data and use it to translate OOV words. Our approach is fully unsupervised and language independent. In the methods to integrate transliterations, we observed improvements from 0.23-0.75 (∆ 0.41) BLEU points across 7 language pairs. We...
متن کاملA Statistical Model for Unsupervised and Semi-supervised Transliteration Mining
We propose a novel model to automatically extract transliteration pairs from parallel corpora. Our model is efficient, language pair independent and mines transliteration pairs in a consistent fashion in both unsupervised and semi-supervised settings. We model transliteration mining as an interpolation of transliteration and non-transliteration sub-models. We evaluate on NEWS 2010 shared task d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2012
ISSN: 0975-8887
DOI: 10.5120/9373-3821